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Asymptotic frequency of shapes in supercritical 
branching trees 
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Abstract 

The shapes of branching trees have been linked to disease transmission patterns. In this paper we 
use the general Crump-Mode-Jagers branching process to model an outbreak of an infectious disease 
under mild assumptions. Introducing a new class of characteristic functions, we are able to derive 
a formula for the limit of the frequency of the occurrences of a given shape in a general tree. The 
computational challenges concerning the evaluation of this formula are in part overcome using the 
Jumping Chronological Contour Process. We apply the formula to derive the limit of the frequency 
of cherries, pitchforks and double cherries in the constant rate birth-death model, and the frequency 
of cherries under a non-constant death rate. 
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1 Introduction 

Branching processes are widely studied and used to model many biological growth phenomena. Al¬ 
though their first and most direct application has been in the study of evolution and extinction m, 
they have also been employed to model infectious disease epidemics m [28] . In this context each 
branching event represents an infection at which a new infectious individual enters the model. The 
individual’s history corresponds to a path in the tree, beginning at the time of infection and ending 
at a tip which represents the individual’s death or recovery. 

In recent years, improvements in sequencing technologies have made it possible to detect micro¬ 
evolutionary events in pathogens. Pathogen sequence data can be used to infer branching trees which 
in turn can inform our understanding of the disease’s transmission dynamics 29]. The inference 

of trees from sequence data becomes more challenging as more and more isolates (tips in the tree) are 
sequenced, presenting significant challenges for the field. Inference relies on tree likelihoods, which are 
derived from branching processes PEHED]. The shapes of phylogenetic trees have also been linked 
to disease transmission patterns mum- 

The shape of a tree can be defined, informally, as the tree without considering the associated 
branch lengths m- The shape distribution for the Yule model was studied originally in order to 
estimate the branching points of a branching diffusion process m • Similarly, numerous studies can 
be found on tree likelihoods and inference, though most exploit the timing of branching events in 
trees rather than focusing on tree shapes. Moreover, the Yule tree is the most well-studied model, 
for which there are results describing internal structure and shape distribution m CE2 ES !■ Hence 
the shape distribution, at least for the homogeneous (Yule) model, can be considered as resolved. For 
non-homogeneous models however, few results are available in the literature. 

The frequency of a shape in a tree is the ratio between the number of occurrences of that particular 
shape and the number of tips in the tree. Some results regarding shape frequency are available for 
simple tree models [23126115]. In more general settings, the limit of the frequency of a shape will 
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Fig. 1: Explanatory figure that shows the choice of notation in a simple tree generated from a 
CMJ process 


depend on the process defining the tree. Therefore, shape frequencies could provide a tool to estimate 
the governing parameters of a branching process, using trees derived from empirical data. Given the 
intractable number of tree shapes with n tips [T4j ((2n—3) • (2n—1)...3-1), computational approaches to 
finding the frequencies of small shape patterns in large trees can have limited success. This motivates 
an analytical derivation of shape frequencies. 

In this work we use the Crump-Mode-Jagers branching process (CMJ process) [I7J to obtain 
the asymptotic frequencies of tree shapes. The CMJ process is a general model with very mild 
assumptions; it includes as special cases the Yule process and the homogeneous (constant-rate birth- 
death) process. We focus on supercritical trees, whose Malthusian parameter is > 1, because they have 
a positive probability of never reaching extinction. Using novel characteristic functions and previous 
convergence properties of the CMJ [23], we derive a general formula for the asymptotic frequency of 
potentially any shape in any tree. The evaluation of the formula presents computational challenges, 
which we overcome in part by applying the Jumping Chronological Contour Process USEE]. We 
evaluate the expression for the asymptotic frequency of some simple shapes in the homogeneous tree 
and a non-homogeneous model. 

2 Background 

The theory of general Crump-Mode-Jagers (CMJ) branching processes provides an ideal framework 
for the study of sub-shapes because they can be counted with characteristic functions. We begin with 
some definitions and the results we have used from the literature on CMJ processes. More details can 
be found in H3 13- 

Notation and definition 

Following Jagers’ setting EH, each individual of the process is assigned a sequence x in the space I of 
all the possible sequences of non-negative integers. The sequence x is chosen uniquely in the following 
way: if the individual with sequence x is the k -th daughter of the individual with sequence y, then 
x = ( y,k ). Therefore, setting the ancestor’s sequence as the singleton 0, each individual’s unique 
sequence keeps track of its predecessors. Since every sequence starts with a 0, for simplicity we will 
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Fig- 2: An example of a Jumping Chronological Contour Process and its relative tree. 


omit this leading 0 from the notation. 

Each individual x is also assigned a random variable A^, the life length of x, and a point process 
£ x , the reproduction of x. Further, it is assumed that the pairs (X x ,£ x ) are i.i.d. but X x and £ x 
may depend on each other. For the scope of this paper we need to make two restrictions on the 
point process £. First, to have a binary tree, we require that £ has no multiple points. Second, let 
li(t) = [£(£)]: for to have supercriticality of the branching process we require lim^oo /i(t) > 1. 

We refer to the constant-rate birth-death process as the homogeneous process. It is a special case 
of the CMJ process: births happen at a constant rate f3 and deaths at a constant rate 5. In the CMJ 
setting, this corresponds to the point process £ being a Poisson process with intensity (3 over the life 
span A, which is exponentially distributed with parameter 6. 

To complete the definition of the branching process, consider the function z x (t) that indicates 
whether individual x is alive at time t: 


z x (t) 


1 if a; is alive at time t 
0 otherwise 


Then the CMJ process {Z(t); t € M(j~} is defined as follows: 


Z(t) = ^2z x (t). 

x£l 


Convergence of CMJ processes 


Because of the self-similar structure of branching processes, renewal theory is often used to analyze 
them; we refer to EunaiiTj for detailed applications and proofs. Because we want to analyze the 
asymptotic behaviour of cherries in supercritical branching processes, it is important to state that 
such processes converge US], in the sense that: 


E[e~ Mt Z(t)} -> 


fo° e~ Mt E[zo(t)]dt 

fo° ’ 


where = E[£(t)] and M is the Malthusian parameter of the process, i.e. the positive real number 
that satisfies / 0 °° e~ Mt n(dt) = 1. 


The Jumping Chronological Contour Process 

Contour processes can be interpreted as “distance to the root” processes [12j, given a proper distance. 
We use the Jumping Chronological Contour Process (JCCP) defined in [HlElj. To avoid unecessary 
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heavy notation we give an informal description here. Consider a ball that visits every point on the 
tree starting from the death of the ancestor. The ball proceeds with speed -1 along the lifetime of the 
individual it is visiting, and when it encounters a birth event (of a descendant of that individual), it 
jumps to the point representing the death of the newborn. When the ball reaches the birth time of 
the individual it is visiting, it continues up the tree to visit the mother of that individual (and her 
descendants, and so on). The process is stopped when the ball reaches the point where the ancestor 
is born. Figure [2] shows the JCCP and its corresponding tree. The JCCP can be very useful because 
of the independence of its defining components (jumps and declines) for all trees with constant birth 
rates. 

Construction of the JCCP can be achieved independently from a previously defined tree through 
simulation of the jumps ji and the drops m (see Figure [2]). When constructing the JCCP two rules 
need to be applied: reflection and killing upon hitting 0. The first occurs when a jump overshoots the 
current time t and it is reflected or “sent back to” the current time. The second is used to to set an 
end for the process whenever it hits the x-axis. With these two properties, the JCCP has the same 
law as the tree [20] . 


3 Characteristic functions for shapes 


The shape S of a tree or a subtree can be defined as the tree or the subtree without the associated 
branch lengths PI- Common small (low number of tips) shapes are the cherry, the three-tip shape 
known as a pitchfork, the four-tip symmetric shape called a “double cherry”, and so on. If the shape 
S occurs inside a tree, it is clear that there is a unique individual which is both a tip and an ancestor 
of S, for each occurrence of S in the tree. If x is the ancestor of shape S we say that x mothers S. 


Definition 3.1: The characteristic of the shape S is 




1 if x mothers and is a tip of S 
0 otherwise 


(1) 


The total number of occurrences of shape S in the tree at time t is Z s {t ) = Ylxei fix W- For simplicity, 
if the suffix x is not specified for the characteristic, we imply the ancestor. Note that the characteristic 
may not be independent between individuals, but we require that it only depends on x’s life and on 
its daughter process. Henceforth we shall assume the following conditions: 

OO 

5^ sup {e~ Mt E[(j>{t)]) < oo, (2) 

k =0 fc - 4 - t+1 


E 


sup 4>(s) 


< oo, for all t < oo. 


( 3 ) 


Statements ([2]) and ^ were used in [23] to prove a number of convergence properties of supercritical 
trees. In general the number of occurrences of a particular configuration (shape) does not converge in 
a supercritical tree. However, given two shapes 5 1 and S 2 with characteristics cf) 1 and cfi 2 satisfying 
equations <m> and ([3]), the ratio of the numbers of these two shapes does converge. From [23] we have: 


Z s \t) , /*e-"^[^(t)]<tt . u , , 

w® d*’ m probablllty as ^ °°' 


( 4 ) 


In equation ([ 4 ]) the characteristics q i 1 and q i 2 are associated with the ancestor, i.e. (j) 1 := (J)q and 
1 t r := (j> q. This is to ensure that the time variable in the characteristic and in the integral are the 
same, without delays. 

The fact that the convergence expressed in equation Q is in probability implies that as the tree 
grows large, the ratio between the occurrences of shapes 5 1 and S 2 tends to its limit (as opposed 
to convergence in the mean). This holds for any tree from the process. The variance of the shape 
occurrence ratio calculated in a group of different trees with the same age t tends to zero as t —> 00 . 
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This property is very appealing when analyzing large trees, as the difference between the shape 
occurrence ratio and its limit may be considered statistically insignificant. 

In particular, if we are interested in the frequency of S (the number of occurrences of shape S 
per tip in the tree), we must divide the number of occurrences of S by the number of tips. A tip is 
also a (very simple) shape T defined by the characteristic (jp~(t) which equals 1 if t is larger than the 
birth time of x. It follows that E[(fP~(t)] = 1 for any t > 0. If the tip characteristic is used in the 
denominator of equation 0 - the denominator is 1/M and we find that the asymptotic frequency of 
S in a tree is 

roc 

M / e~ m E [0 5 (t)] df, (5) 

Jo 

with cf> s := 4 >q to ensure consistency between the time variables. 


4 Cherries 

4.1 Cherries in homogeneous processes 

Cherry characteristic 

At time t, a cherry C is the configuration made by an individual x and its last descendant (x,^ x (t)) 
when the latter has no descendants, i.e. €(x,£ x (t)){t) = 0 . It is called a cherry because it visually 
resembles a cherry (two tips of the tree joined at the same node). To be the ancestor of a cherry, x 
must have at least one daughter (i.e. ^(i) > 1 ), and x’s last daughter must have no descendants, i.e. 
= 0- Accordingly, the cherry characteristic can be written as follows: 




1 if £x{t) > 1 and £(*,&, (*))(*) = 0 
0 otherwise 


( 6 ) 


The total number of cherries in the tree is then: Z c (t) = Ylxei <$£(£)• 


Derivation of E [</> c (t)] 

In the homogeneous process, all individuals alive at a given time generate offspring at a constant rate 
j3 and die at a constant rate 5. In order to derive the asymptotic cherry frequency using equation ([5]) 
it is necessary to first derive E [</> c (i)]. For this purpose it is convenient to use the JCCP because in 
the homogeneous setting the JCCP is the stochastic process that has almost everywhere derivative -1, 
which jumps at rate /3 and whose jumps have random size exponentially distributed with parameter 
5 [21]; in addition the process is sent back to the current time t whenever it overshoots (reflection) 
and is killed upon hitting 0. Since a tree uniquely defines a JCCP and vice versa, then the law of the 
JCCP is also the law of the tree (modulo labelling of the tips). 

In order to carry out analysis using the JCCP we use u% for the i-th inter-jump time and ji-\ for 
the Ath jump size; the notation is depicted in Figures [2] and [3] We aim to find a relationship among 
the u and j variables in the JCCP that is equivalent to the cherry characteristic ([6]). Recall from 
section [ 3 ] that <J C (t) in Eq. ([5| is (£)• F° r this reason we will only focus on the part of the JCCP 
which corresponds to the ancestor and its last daughter: jo, u±, j 1 , 112- 

First note that the ancestor has at least 1 daughter if and only if the JCCP does not hit 0 before 
the jump j\. This ensures the presence of at least one daughter. Secondly, for the last daughter to 
have no descendants, the second low peak of the JCCP (the base of the third jump 72 ) has to be lower 
than the first (the base of the second jump j 1 ). 

Recall that the JCCP involves reflection, which means that when the JCCP overshoots, i.e. it 
jumps beyond the current time threshold t, the process is sent back to t. Because reflection may 
happen on either of the two jumps, there are four distinct cases that give rise to a cherry (see Figure[3]). 
Therefore, a cherry including the ancestor occurs only when the quartet (jo, u 1 , j 1 , u r 2 ) is in one, and 
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r u: r+++++ 


[fie / ‘ ,l ')du 2 dj l du 1 dj 0 


s:s'.rj:+ , +°++*) 


fie fu ‘]du 2 dj 1 du l dj 0 


Fig. 3 : Schematic for the calculation of E[(ft c (t)]. Because each of the two jumps jo and ji of 
the JCCP related to a cherry can overshoot above the current time t, four different 
cases have to be examined. The probability of each case can be derived with a 4- 
dimensional integral. In the computation, recall that the j t ~ Exp(5 ) and u 2 ~ Exp(/3), 
for i = 0,1, 2. 


one only, of these four subsets of 


»4 . 


A : 


' o < jo < t 

' 0 < jo < t 

' t < jo < oo 

0 < ui < j 0 

0 < u\ < j 0 

0 < ui < t 

, B : 

, C : { 

, D : < 

0 < ji <t-j o + m 

t-jo + m < ji < oo 

0 < Ji < u\ 

„ jl < U 2 < oo 

k t - jo + Ui < U2 < OO 

, jl < u 2 < oo 


0 < ui < t 
U\ < ji < oo 


To find the probability that the ancestor mothers a cherry, we are left to measure the four sets above. 
Recall that ji ~ Exp(<f) and Ui ~ Exp(/3), then d (ji) = 5e S]l dj t and d(ttj) = j3e~^ Ui dUi for i = 0,1, 2. 
We derive the probability of each set: 


A 


t rj o rt—j o+«i r°° 



0 JO 


L 


d(w 2 )d(ji)d(ui)d(j 0 ) = 


(35 


25 




((3 + 5) 2 2(3 + 5 


e~ st + 


2 5 2 


(P + 8) 2 

/35 2 




2(/3+5)i 


B 


C 



3 o roo !■ 

3t—jo+ui Jt- 


0 JO 


d(n 2 )d(ji)d(rii)d(j 0 ) = 


-e~ 5t - 


(2 (3 + 5)((3 + 5)i 
,~(J3+S)t , P 6 


—jo+Ul Jt—jo+Ul 


2/3 + 5 P + 5 
5 


+ 


3 —2(/3+<5)t 


(2/3 + 5)(P + 5) 


o Jo 


u i roo r 

4 '‘(U 2 )d(j,)d(» 1 )d 0 „) = ^e-« - — e 


~(P+S)t 


+ 


ps 


((3 + 5)(2p + 5) 


=>—2(/3+<5 )t 
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D 


roo rt roc roc 

: / / / / d(u 2 )d(ji)d(ui)d(j 0 ) 

7o </iti 


poo /*£ 7*00 /»oo 

/O Ju\ 

Now we only need to sum the four integrals: 

_ P6 , P „—St 2(35 


P 


2(3 + 5 


e~ 5t - 


P 


2/3 + 5 


3 —2(/3+(S)t 


E [< f(t )] = 


+ 


(P + 6) 2 2/3 + 5 


; e — 


(/3+<5)t 


/3 3 


(P + sy 


(2(3 + 6)(P + 5Y 


3 —2(/3+<5)t 


(7) 


Note that as 1 -> oo, E[c(t)\ converges to P(£(oo) > l)P(£(oo) = 0) which is the product of the 
probabilities of the two events that define the cherry characteristic. Moreover for t = 0 the terms 
cancel and E[c[t )] = 0; this confirms the impossibility to generate a cherry when no time has elapsed. 


The cherries to tips ratio in homogeneous processes 

By “cherries to tips ratio” or CTR we indicate the limit of the frequency of the cherry shape C in a 
tree. Using equation ([5]): 


z c (t) 

z r (t) 


CTR = M 


f 


e~ Mt E[(/) c (t)]dt. 


( 8 ) 


In homogeneous branching trees the Malthusian parameter is M = (3 — 5 and it is positive because we 
assumed the process to be supercritical. The mean number of offspring for each individual, or basic 
reproduction number, is Rq = P/5 which is always greater than 1 under the supercritical assump¬ 
tion. Substituting equation (7) we can compute the integral in equation (8) to obtain ^ 3l g + J)(/s-S) • 
Substitution of M and Rq gives 

CTR =3yh = 3 wrv (9) 

Note that in the limit P —> oo, that is the limit where the homogeneous tree tends to a Yule tree, the 
CTR tends to 1/3 which is a known result for Yule trees |26l 122] . The methodology used to derive 
the result in equation ([9]) can be applied to different shapes and different choices of the branching 
process (i.e. non-homogeneous). When dealing with non-homogeneous trees we should bear in mind 
that the JCCP maintains its most important property (independence of j t and Ui) only when there 
is a constant birth rate. So for this approach to apply, the non-homogeneity must come from a non¬ 
exponential lifespan rather than a non-constant birth rate. Otherwise, evaluation of E[(j) S } is likely to 
be challenging. 


4.2 Cherries in a non-homogeneous model 


Using the same approach as in section |4.1[ we found the cherry to tips ratio in a non-homogeneous 
model. If we choose the life-span distribution to be a Gamma distribution with rate 5 and shape 2, 
the Malthusian parameter and Rq are given by 


M — —(3 — 5 + — \/P 2 + 4 PS, i?o — 


2 P 
5 ' 
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Using equation ([ 5 ]) after evaluating E[(jf(t)] 


in this model, the cherry to tips ratio is: 


CTR = ^512 ^243f?g 2 + 243f?jf /:2 \fRo + 8 + 5103 R l 0 l + Al?>lRf ,2 yjRo + & + 40851 R^+ 

+ 26271 j Rq 9/2 v / ^o + 8 + 160767 R 9 0 + 80955i?J 7/2 ^/R 0 + 8 + 338148i?(j+ 

+ 1311847?q 5/2 a/ 7?o + 8 + 387448 R 7 0 + 112912i?o 3/2 ^R 0 + 8 + 2350727^+ 

+ 49152i?g 1/2 \/-Ro + 8 + 68784i?g + 936OR0 9/ V-Ro + 8 + 76167?^+ 

+ 5127?q / 2 y' / ifo~+~8 + 1287$) J x 


x ^27 ’7$ + 277?q /2 v / -Ro + 8 + 2977$ + 1897$ /2 ^/R 0 + 8 + 9007$ + 3607$ /2 ^/R 0 + 8+ 

+ 9687$ + 2407?q / 2 y/-Ro + 8 + 3687? 0 + 48^0 + 8 a/Ro + 32^) x 
x ^3i? 0 + \JRo + 8a/ Ro^j x ^7?o + \/Ro + 8\/7?o^ x ^5i?o + \/hri + 8 \/Ro T4^ 

( 10 ) 


5 Pitchforks and double-cherries 


Pitchfork characteristics 

A pitchfork V is a configuration with three tips (illustrated in Figure [4|. A pitchfork is formed when 
an individual x‘ last two daughters each have no descendants, or if her last daughter has only one 
descendant. The characteristic is written accordingly: 




1 

< 

0 


if 


&(*) > 2 and £(*,&(«))(*) = 0 and £( x , &(t )-i)(t) = 0 
or 


. £ x (t) > 1 and = 1 and C( x ^Rt),i)(t) = 0 

otherwise 


( 11 ) 


The pitchfork to tips ratio in homogeneous processes 

In order to derive the pitchfork to tips ratio (PTR) we use equation ([ 5 ]). To evaluate E[<j) P (t)\ we 


use the JCCP as in section 4.1 from the definition in equation 0 we determine the relationships 


among the first three peaks and three drops of the JCCP that correspond to a pitchfork in the tree. 
The resulting set in M 6 is formed of 16 components that are measured with 6-dimensional integrals 
using the software Maple jTJ. There are two cases when a pitchfork is formed: the first extends the 
cherry and occurs when the ancestor has at least two daughters and the last two of them have no 
descendants. The second case occurs when the ancestor has at least one daughter and the ancestor’s 
last daughters has one only daughter who does not have any further descendants. Because of possible 
overshooting, for each case there are 8 different sets that we should measure: 

io j? iir ,+n /” d(« 3 )do 2 )d(<. 2 )dOi)d(«i)do„) 


t no rt—jo+ui rjo—ui+ji r°° 


n no r 

Jo Jo J C 


0 


fi 


31 


roc 

Jt- 


r 00 

Jt-j 


j0+Ul~jl+U2 Jt-jo+lLl-jl+U2 


d(« 3 )d(j 2 )d(u 2 )d(ji)d(ui)d(jo) 


Jo Jo° Jt-jo+u 1 ft-jo+u 1 C JJJ d( W3 )d(j 2 )d(u 2 )d(ii)d( Wl )d(jo) 
Jo Jo° Jt-j 0 + Ul Jt-jo+m !Z Ju 2 d(n 3 )d(j 2 )d(n 2 )dO-i)d(ni)d(jo) 

r io J0 1 j;r i+h Jo ui ~ n+U2 dMd^dMd^dMdOo) 
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Jt°° Jo Jo" Jl Ul+n f™- jl+ u 2 LT-ji+tia d(« 3 )d0'2)d(u 2 )d(ji)d(«i)d0b) 

r /o c r d^do^d^jdo-od^jdcio) 

r jo Ju 7 ji c jz d(«3)d(j 2 )d(« 2 )do 1 )d(n 1 )d( JO ) 

JoC Jo~ JO+Ul Ji 1 Jt j0+Ul ~ jl+U2 r i0+Ul d(^3)d0 2 )d(n 2 )d( J1 )d(u 1 )dOo) 

Jo Jo° Jt J0+Ul Jo 1 Jt- jo+ul - jl+ u 2 Jt-jo+ Ul dK)d(j 2 )d(n 2 )d(j 1 )d(ui)d(jo) 

/o Jo° Jt-jo+u, Jt 3 ° +Ul Jo U2 Jt-u 2 +j 2 -jo+ Ul d(u 3 )d(j 2 )d(u 2 )d(j 1 )d(u 1 )d(j 0 ) 

Jo Jo° Jt-jo+u, Jo~ JO+Ul Ju 2 Jt-jo+u, d(u 3 )d(j 2 )d(w 2 )d(ii)d(ui)d(jo) 

Ji°° Jo Jo 1 Jo 1 Jo Ul - Jl+U2 Jj\-u 2 +j 2 d(«3)d(j 2 )d(« 2 )d(ii)d(ui)d(jo) 
r Jo Jo U1 Jo Jl j~- jl+ u 2 /“ d ( M 3 )d(j 2 )d(tt 2 )d(j 1 )d(u 1 )d(jo) 
r Jo Jo Ul r Ju 7- U2+J2 d(us)d( j2 )d(u 2 )d(n)d( Ul )d(^ 
r Jo J uT r /“ C d(« 3 )d(j 2 )d(n 2 )d(j 1 )d(u 1 )dOo) 

To evaluate the integrals, recall that ji ~ Exp(<5) and tq Exp(/3), then d(jj) = de^dji and d(tq) = 
Pe^^dui for i = 0,1,2,3. The expression for the pitchfork to tips ratio in the homogeneous model 
is: 

3 (p-6)((5 + 5)P 2 
(2/3 + 8)(p-6)(3/3 + 8)* 

3(Rp + 1)(-Rp) , . 

(2f? 0 + l)(3f?o + l) 2 ' 1 j 

Note that the limit of PTR as —i► oo, where the homogeneous model tends to the Yule model, is 

1/6, which corresponds to Rosenberg’s result for the Yule model in [26]. 


5.1 Frequency of the symmetric configuration with 4 tips in 
homogeneous processes 

The characteristic 

Let’s consider a configuration S of four tips organized in two cherries. In order to count such config¬ 
urations we define a characteristic which assigns 1 to every individual x which is both the ancestor 
and a tip of the configuration. This happens if among the daughters (at least 2) of x, the last has no 
descendants and the second last has one only descendant. Equivalently, the ancestor mothers a cherry 
and so does the ancestor’s second last daughter. 


4%{t) 


1 if €x(t) > 2 and £( x £ x (t))(t) = 0 and £( x £ x (t)- l)(*) = 1 and £(z,6Rt)-i,i) = 0 

0 o.w. 


(13) 
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Derivation of the asymptotic frequency 

We use equation ([5]) to evaluate the asymptotic frequency of S. As in the previous sections we evaluate 
E[(j> s (t)] using the JCCP. Its derivation involves 16 8-dimensional integrals: 


ft fj 0 ft-jo+Ui fjo—Ul+ji ft-jo+Ui-j!+U2 rh 

Jo Jo Jo jj 1 Jo Jo 


Jt j 0 +m n+u2 n+u 3 f™_ a3+h d (M 4 ) d (i 3 ) d (M 3 ) d (j 2 ) d (w 2 ) d (j 1 ) d (w 1 ) d (j 0 ) 


rt rjo rt-jo+u i 

Jo Jo Jo 


rjo-ui+ji 

'jl 


rt-j o+«i -ji+u2 

Jo 


fj 2 r°o 

0 Jt-jo+u 1 -ji+U2-j2+U3 


Jt-jo+ul-jl+U 2 


d(M4)d(j3)d(w 3 )d(j 2 )d(M2)d(ji)d(Mi)d(jo) 


rt fio ft-jo+ui 

Jo Jo Jo 


fjo— Ul+jl fOO 
j 71 Jt- 


ft-jo+Ul-jl+U2 


t-jo+ui-ji+u 2 JO 


Jo 3 J't°^u 3 +j 3 -j 0 +u 1 -j 1 +u 2 d(w4)d(j 3 )d(M3)d(j2)d(w 2 )d(ji)d(Mi)d(jo) 


Jo So sr° +ui f™~ ui+n S™ j0+U1 _ jl+U2 /o _JO+U1 “ J1+ “ 2 ft- jo+ul - jl+U2 d(M4)d(i3)d(w3)d(j2)d(w 2 )d(ii)d(wi)d(io) 

Jo JZ Jt-j 0 +u 1 /i-jo-Hil JJ 2 Jo 2 Jo U2 - J2+U3 Jh-u 3+j3 d(«4)d(j 3 )d(u 3 )d(j2)d(«2)d(ii)d(ui)dO'o) 

So Jo JZjo+m Jt-j 0 +m C JZ Jun—j 2 +u 3 C d(«4)d(i3)d(u 3 )dO'2)d(u2)d(ji)d(«i)dO'o) 

Jo Jo 0 Jt-j 0+Ul Jt-jo+u 1 JZ JJ 2 JZ JZ- U3+j3 d(«4)d(j3)d(n 3 )d0 2 )d(n 2 )d(ji)d(ui)dOo) 

/o JZ Jt-jo+u! Jt-jo+m JZ JJ 2 JZ JZ d(^4)d(j3)d(«3)d(j2)d(u 2 )d(ji)d(ni)d(jo) 

JZ Jo Jo" JZ Ul+n Jo Ul ~ Jl+U2 JZ fr jl+U2 ~ h+U3 JZ-u 3+j a d('U 4 )d(j 3 )d('U 3 )d(j 2 )d(’U 2 )d(j 1 )d(u 1 )d(jo) 
JT Jo Jo" 1 JjJ Ul+n JZ~ n+U2 JZ JZ-j 1+ u 2 - j2 +u 3 JZ-ji+u 2 d(«4)d(j 3 )d(n 3 )d(j 2 )d(n 2 )d(ji)d(«i)dOo) 
/t°° Jo JZ JZ U1+n JZ-ji+u 2 JZ~ n+U2 JZ JZ- jl+ u 2 -u 3+j3 d(«4)dO' 3 )d(« 3 )d(j 2 )d(« 2 )d(ji)d(«i)dO'o) 
JZ Jo JZ JjJ Ul+Jl JZ-ji+u 2 JZ~ n+U2 JZ JZ-ji+u 2 d(«4)d(j3)d(«3)d(j 2 )d(n 2 )d(ji)d(m)d(jo) 

Ji°° Jo JZ Ju 1 JT JZ / 0 “ 2_j2+ “ 3 /“_ U3+i3 d(u4)d(j3)d(n 3 )d(j2)d(u 2 )d(ji)d(ui)dOo) 

JZ Jo JZ Jl JZ JZ JZ-j 2 +u 3 JZ d(u 4 )d(j 3 )d(u 3 )d(j 2 )d(n 2 )d(j 1 )d(n 1 )d(jo) 

JZ Jo JZ Jl JZ JZ JZ JZu 3+33 -u 2 d(-4)d( J 3)d(n3)d0 2 )d(n 2 )d( J1 )d(n 1 )d( J o) 

/t°° Si JZ Jl JZ Jo 12 JZ JZ dK)d(j3)d(n 3 )d0 2 )d(n 2 )d(ji)d(ni)d(jo) 

To evaluate the integrals, recall that ji ~ Exp(<5) and tq rs_/ Exp(/3), then d(jj) = 6e s ^dji and d(tq) = 
f3e~P Ui dui for i = 0,1, 2, 3,4. The asymptotic frequency of S is given by: 

-|j-(2592i?o + 11556i?g + 182797$ + 13899+ 4799- 657$ - 5467$ - 1147$) 

(19440i?o + 910447$ + 187488^ + 2227417$ + 1681807$ + 8366677(5 + 274167?(j+ ( 14 ) 

+ 57057$ + 6847?o + 36) _1 

6 Conclusion 

In this paper we have presented a novel technique to compute the frequency of any shape configuration 
in a tree. These frequencies, namely the ratio of the number of occurrences of a specific shape 
configuration to the number of tips in the treee, converge in probability to the expression in Eq. ([5]) 
as t -» oo. We have applied the technique to evaluate the asymptotic frequency of cherries in the 
homogeneous process and in a non-homogeneous process (with non-exponential death/recovery rates). 
For the homogeneous tree, we have also derived the frequency of pitchforks and of the symmetric shape 
with four tips (double cherry). In Figure [4] we present a summary of the results. 
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Configuration 


Model 


As. frequency 


Plot and limit 


cherry 


Homogeneous 
birth rate: p 
death rate: 5 
R = P/5 


Rr 


3R 0 +1 



cherry 


Non homogeneous 
constant birth rate p 
life span distribution: 
Gamma (rate=5, 
shape=2) 

R = 2p/6 


(512 (243Rj 2 +243R“ ,2 V(«o+8)+5103R|' + 

+4131 Ro'V( +8)+40851 R‘" +26271R**' 2 v'( /*„+ 8)+ 

+160767R’+ 80955R' 0 7 '\/(R„+8|+3381 4 8 R *+ 

+ 131184Ri 5 ' 2 v(R 0 +8)+387448R’+112912Rj" 2 v'(R„+8)+ 
+235072R;+49152R, i 1 1 ' 2 V(R a +8)+68784R,'+ 
9360R0'"\/(R;+8)+7616Rj+512R’'\/(R;+8)+128R’)) 

((27Ri,+27Rj' 3 V(«„+8 )+2971^+l 89R J'V( R„+8|+ 

+900Rjj+360Rf V(^+8)+968R 2 +240RJ, 3 " 1 •,!(/?„+»)+ 
+368R 0 +48sqrt (R„+8 )f{Ra)+ 32)“' 

_ORn+x/fff^v TO)- 2 

(R 0 +i/(Ro+8)V(^))' 2 (SR 0 +V(A«+8)V(^)+4r 2 



pitch 

-fork 


Homogeneous 
birth rate: p 
death rate: 5 

R ,= P/6 


3R 0 -(i? 0 +l- 


(3R 0 +i; 


(2r„+i; 



5 7.5 10 


RO 


double 

cherry 


Homogeneous 
birth rate: p 
death rate: 5 
R 0 = p/5 


l/4( 2592RJ +11556R® +18279R’ + 
+ 13899 Rq+4799Rq— 65Rp — 

— 546Rq— 1 14Rq) 
(19440R0’+91044R0 ! + 

+ 187488R0 7 + 222741R0 6 + 

+ 168180R0 ! +83666R0 4 + 
+27416R0'+5705R0 2 + 
+684R0+36) 1 



Fig. 4: Summary of the analytical results. 
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